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INHIBITION OF NON-CD4 MEDIATED HTV INFECTION 

TECHNICAL FIELD' OF TFTF INVENTION 

5 

The present invention is directed to a non-CD4 cell surface receptor for gpl20. 
This gpl20 receptor (gpl20r) has been isolated and cloned and is utilized in the present 
invention in methods and kits for the inhibition and detection of HTV infection. 

10 BACKGROUND OF THE I NVENTION 

Two types of human retroviruses have been identified, leukemia viruses and AIDS- 
related viruses. The primary targets of the human retroviruses are T lymphocytes and cells 
of the central nervous system. All human retroviruses are transmitted by intimate contact, 

15 blood contamination, and infection in utero or after birth by milk. It is likely that all 
human retroviruses originated in Africa and that they encountered the human species via 
interspecies infection, possibly from African green monkeys or a related species. The 
human retroviruses first discovered, Human T Lymphotropic Virus Type 1 (HTLV-1) and 
Human T Lymphotropic Virus Type II (HTLV-H), have a preferential tropism for T4 cells 

20 and some T8 cells, share significant sequence homology, and are mainly associated with T 
cell leukemias and lymphomas. The other group of human retroviruses, generally called 
Human Immunodeficiency Viruses (HIV), is discussed in greater detail below. There are 
two major differences between the two types of human retroviruses: (1) there is substantial 
genomic variability among various HTV isolates, whereas the genomes of HTLV-I and 

25 HTLV-n are stable; and (2) HTV entered human populations much more recently than 
HTLV-I or HTLV-n. 

The human immunodeficiency virus (HTV) is a cytopathic retrovirus and the 
causative agent of the acquired immunodeficiency syndrome (AIDS). Two forms of HTV 
have now been identified. The prototype virus, HTV-1, previously termed 

30 lymphadenopathy-associated virus (LAV) and Human T Lymphotropic Virus Type HI 
(HTLV-IH), is responsible for the vast majority of reported AIDS cases worldwide. 
Another retrovirus, HTV-2, has been isolated primarily from West African patients with 
AIDS and is pathogenically related to HTV-1 . On the genetic level, HTV-2 is actually more 
closely related to the simian immunodeficiency virus (STV), a retrovirus infecting 

35 monkeys. 

Over half of the people that have contracted AIDS in the United States have already 
died. As many as three million persons in this country, may be asymptomatic carriers of 
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HIV and are capable of transmitting the virus. It had been estimated in 1986 that 270,000 
cases of AIDS will have occurred in the United States by 1991 (U.S. Public Health 
Service, (1986), Public Health Rep. lfll:341). The mortality rate from AIDS is 
disturbingly high, exceeding 80% within three years of diagnosis and possibly reaching 
5 100% over a longer period. 

Worldwide, the AIDS epidemic may involve some five to ten million presently 
infected persons. Particularly troublesome are statistics from the African continent where 
millions of individuals are believed infected with HIV, deaths range in the hundreds of 
thousands, and heterosexual transmission predominates. To date, there is neither a known 

10 cure for AIDS nor an effective vaccine against HIV infection. 

HIV is a member of the nontransforming, cytopathic lentivirus family of 
retroviruses. HIV causes a typically fatal disease characterized by severe 
immunodeficiency or neurodegenerative disease, or both. The primary basis for HIV 
induced immunosuppression is the depletion of the helper/inducer subset of T lymphocytes 

15 expressing the CD4 molecule (T4 or CD4 + cells), which serves as a high affinity cell 
surface receptor for the virus. T4 lymphocytes are involved directly or indirectly in the 
induction of nearly every immunologic function in the body, and their depletion results in 
susceptibility to a wide range of opportunistic infections and neoplasms. 

In addition to the T4 lymphocyte, other cells expressing the CD4 molecule are 

20 targets of HIV infection, especially monocyte-macrophages. HIV infection also results in 
serious B cell abnormalities including polyclonal activation, hypergammaglobulinemia, 
elevated levels of circulating immune complexes, and autoantibodies. A decreased number 
of functional natural killer (NK) cells have also been observed in AIDS patients. 

Infection of CD4 + cells is initiated by the interaction of the CD4 molecule with the 

25 major HIV envelope glycoprotein gpl20, an event which is followed by internalization and 
uncoating of the virion, transcription of genomic KNA to DNA by virus-encoded reverse 
transcriptase, and integration of the resulting proviral DNA into host cell chromosomal 
DNA. Also, unintegrated proviral DNA accumulates in large amounts within infected cells 
and is probably a significant factor in HIV cytopathology (Shaw et aL, (1984) Science 

30 22fi:1165). 

The depletion of CD4 + T cells appears to contribute significantly to the 
immunosuppression associated with AIDS. A primary cytopathic effect of the virus in 
vitro is HTV-induced syncytium formation. CD4, through its interaction with gpl20 plays 
an important role in syncytium formation. However, it has been observed that molecules 
35 on the cell surface of uninfected cells other than CD4 are also involved in EHV-induced 
cell fusion (ffildreth et al. (1989) Science 244:1075-1078). 
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Infection by HIV produces, in addition to AIDS, a set of neuropsychiatric disorders 
which are called the AIDS dementia complex (ADC) (Price et al., (1988X222:586-592). 

The symptoms of ADC include cognitive impairment, apathy and motor 
dysfunctions, and may affect as many as 90% of AIDS victims. The underlying cause of 
5 ADC appears to be the death of brain cells and HTV-1 can be isolated from the brains of 
infected individuals (Ho et al, (1987) N. Eng. J. Med. 212:278-286). 

An early study suggested that the cellular attachment site for HIV in brain might be 
CD4 (Pert et al., (1986) Proc. Natl. Acad. Sci. USA 22:9254-9258) but attempts to 
replicate these findings were not successful (Kozlowski et al., (1989) Neurosci. Abstr. 
to 15:671). It now appears unlikely that the CD4 antigen is involved in the infection of 
brain-derived cells by HIV. Susceptibility of brain cells to infection with HTV-1 does not 
correlate with the level of expression of CD4 (Chang-Mayer et al., (1987) Proc. Natl. 
Acad. Sci. USA 21:3526-3530; Srinivasan et al., (1988) Arch. Virol. 25:135-141), and 
infection of brain-derived cells by HTV-1 is not blocked by anti-CD4 antibodies (Clapham 
is et al., (1989) Nature 222:368-370; Ii et al., (1990) J. Virol. 64:1383-1387). 

The present invention demonstrates the presence of a non-CD4 receptor for gpl20 
and a method for the inhibition of HTV infection of cells such as brain and muscle which 
do not express high levels of CD4. 

20 STTMMARY OF THE INVENTION 

Many cells that are susceptible to HIV infection appear to bind gpl20 through a 
non-CD4 surface protein. The present invention has identified this non-CD4 gpl20 
receptor (gpl20r) and has recombinantly expressed and characterized gpl20r. 

25 In this invention a specific non-CD4 gpl20r has been isolated which has specific 

binding activity for gpl20 present on Human Immunodeficiency Virus-1 (HIV). This 
gpl20r has a molecular weight of about 45, 000 daltons, contains about 400 amino acid 
residues and is characterized by a Kd for gpl20 of about 1.3 nM to about 2.0 nM. The 
binding of gpl20 to gpl20r is inhibited by specific carbohydrates, such as mannose and 

30 fucose, plant lectins such as concanavalin A and specific antibiotics, such as pradimicin A. 

In one embodiment of the present invention, a cDNA molecule that transcribes an 
mRNA encoding for gpl20r is cloned and expressed to produce gpl20r. The DNA is 
selected from a gene library obtained from tissue such as placenta, brain, muscle and 
colon. 

35 A method of inhibiting HTV infection of mammalian cells, such as brain, muscle 

and neural cells, is contemplated by the present invention. In this method, cells are 
contacted with an effective amount of an appropriate inhibitor of gpl20r binding for a time 
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period sufficient to significantly inhibit the binding of HIV to the non-CD4 protein, 
gpl20r. Specific inhibitors of gpl20r binding include mannose carbohydrates, fucose 
carbohydrates, plant lectins, and antibiotics such as pradimicin A. 

The gpl20r of the present invention can also be utilized in a method and a kit for 
the detection of the presence ofHIVinafluid sample. In this method, the binding of HTV 
to gpl20r is detected by an indicating means such as a labelled antibody capable of binding 
to the HIV-gpl20r reaction product. It is also contemplated that the gpl20r can be affixed 
to a solid matrix to form a solid support that is useful in this method and/or kit. 

Tre flra iPTTQKT "F THE FIGURES 

In the drawings: 

FIGURE 1 illustrates expression cloning of the gpl20r cDNA and comparison to 

CD4 

~ A- Autoradiography of gpl20 binding to gpl20r and CD4 expressed in COS 
cells. A-F [^IJvgpUO; A, gpl20r; B, gpl20r with G17-2; C, gpl20r with 
200 nM unlabeled bgpl20; D, CD4; E, CD4 with G17-2; F, CD4 with 
bgpl20. G-L [^HngpUO; G, gpl02r, H, gpl20r with 110.1; I, gpl20r 
with bgpl20; J, CD4: k, GD4 with 110.1; L, CD4 with bgpl20. 
B: Inhibition of [ 125 I]vgpl20 binding to gpl20r and CD4. A-F gpl20r and G- 
L CD4. A+G, fflV antisera (1:20; Trimar); B+H, D-galactose (100 mM); 
C+I, D-mannose (100 mM); D+J, L-fiicose (100 mM); E+K, 
Concanavalin A (1 mg/nu); F+L, pradimicin A (100 fig/ml). 
C: gpl20r binding of HIV. A, HTV; B, HIV with 200 nM bgpl20. 

FIGURE 2 illustrates the characterization of the gpl20r. 

A: Scatchard analysis of [^Ugp^O binding. A - A, vgpl20 binding to 
placenta, Kd L3 nM, 19 fmol/mg protein; ■ with ftg/ml G17-2; • - 
•, vgpl20 binding to gpl20r COS cells, Kd 1.7 nM, B^ax 150,000 
receptors/cell (R/Q; O, ngpl20, Kd 1.8 nM, 149,000 R/C. 

B: Inhibition of [^IJgpUO binding to gpl20r COS cells. Open symbols 
ngpl20, filled symbols vgpl20. The relative values were the same with 
both forms of gpl20. Mannan expressed as mg/ml. □, mannan QC50 6 
/rg/ml); •, L-fucose (K ' . 6 mM); A, a-methyl D-mannoside (K 15 mM), 
O, D-mannose (K 23 mM); O, N-acetylglu(»samine (Kj 70 mM), ■, 
EGTA (K j 0.3 mM). 
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C: Internalization of gpl20 by gpl20r COS cells. Points represent the mean of 
two experiments with vgpl20 and ngpl20. • - •, suface; O - O internal. 

D: Placenta control sera; 2, placenta HIV sera; 3, gpl20r COS control sera; 4, 
gp!20r COS HIV sera. 

E: Northern blot of gp!20r expression. Polyadenylated (A + ); 2, placenta; 3, 
thymus; 4+12, forebrain; 5, skeletal muscle; 6, heart; 7, liver; 8, kidney; 
9, colon; 10 medulla; 11, cerebellum; 13, T cell (OEM; 16 jig A+) 14, B 
cell (TS-1; 16 jig A+); 15, macrophage (U937; 8 jig A+); 16, cervical 
carcinoma (HeLa; 16 /xg A + ). Hie different apparent size of the '5 kb 
band is an artifact of displacement by 28S rRNA. 

FIGURE 3 illustrates the sequence analysis of the gpl20r. 

A: Nucleotide and deduced protein sequence of gpl20r cDNA. 

B: Hydropathicity plot of the gpl20r. The predicted transmembrane segment 

and the start of the eight amphipathic repeats are indicated by arrows. 
C: Aminoacid alignment of the gpl2Qr C-type lectin domain. 

DESCRIPTION QF PREFERRED gMBQDIME^TS 

HIV infection of brain and muscle cell lines is not blocked by soluble CD4 or anti- 
CD4 antibodies (Clapham, P.R. et al., (1989) Nature 222:368-370; Harouse, J.M. et aL, 
(1989) J. Virol. ^2:2527-2533; Weber, J. et al., (1989) J. Gen. Virol. 7Q:2653-2660). 
This is consistent with the existence of a second gpl20 receptor. Binding studies indicated 
that human placenta was another source for a non-CD4 gpl20 receptor, and a cDNA for a 
second gpl20 receptor (gpl20r) was isolated by the present invention from a placental 
library. The gpl20r has a higher binding affinity for gpl20 than CD4. Sequence analysis 
revealed homology to membrane associated C-type lectins, and inhibition studies have 
shown that the receptor binds gpl20 through a mannose or fucose containing carbohydrate. 
The gpl20r rapidly internalizes gpl20, and is expressed in placenta, thymus, muscle, and 
colon. These results, when considered with previous studies on the role of gpl20 
carbohydrate in HIV infection (Lifson, J. et al., (1986) J. Exp. Med. 164:2101-2106; 
Ezekowitz, R.A.B. et al., (1989) J. Exp. Med. lfg: 185-196; Larkin M. et al., (1989) 
AIDS 2: 793-798; Tanabe-Tochikura A. et al., (1990) Virology 126:473-476), suggest a 
potential role for the gpl20r in HIV infection or pathology. 

The present invention demonstrates that the gpl20r participates in cellular binding 
of HIV by a non-CD4 pathway in muscle and brain, as well as, facilitating virus 
attachment in CD4 positive cell types. It is likely that the gpl20r plays a significant role in 
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transplacental transport of HIV (Zacher, V. et aL, (1991) J.. Virol. &:2102-2107) and 
colon infection. (Barnett, S.W. et al. (1991) Virol. 122:802-809). Gpl20 produces an 
increase in intracellular calcium in rat retinal ganglion cells (Dreyer, E.B. et aL, (1990) 
Science 24g:364-367) suggesting that the gpl20r or a homologous protein may have 
5 signaling functions in the nervous system disrupted by gpl20 leading to HIV neurotoxicity. 

In the present invention, a new non-CD4 binding protein, or receptor, for gpl20 
was isolated. The HTV surface protein gpl20 was found to bind to a receptor on human 
placental membranes that was not blocked by antibodies directed against CD4, such as 
G17-2 and OKT4a, and which interfere with gpl20 binding to CD4. A cDNA encoding 
10 this receptor was isolated from a placental cDNA library in a mammalian expression v^pr 
(pCDM8). The gene products were expressed in COS cells and were screened by I- 
labelled gpl20 binding. From a pool of 90,000 cDNA molecules, a single clone was 
isolated that encoded a protein which bound gpl20, even in the presence of concentrations 
of anti-CD4 antibody (G17-2) which completely blocked gpl20 binding to CD4. 
15 Sequence studies were carried out and indicated that the 1.5 kilobase cDNA clone 

encoded a previously unknown member of a family of Type H membrane proteins with an 
extracellular C type lectin domain. 

The cloned gpl20r of the present invention binds gpl20 with an affinity (Kd) of 
about 1 to 2 nM, which is considerably greater than the affinity of CD4 for gpl20 (about 
20 Kd = 4 nM). 

The binding of gpl20 to gpl20r is not blocked by polyclonal HIV antisera, but is 
inhibited by mannose carbohydrates, rucose carbohydrates, plant lectins such as 
concanavalin A and pradimicin A antibiotics. Other sugars such as N-acetyl-d-glucosamine 
and galactose are less potent inhibitors. 

25 The gpl20r is expressed on many mammalian cells which do not exhibit high levels 

of CD4, such as placenta, skeletal muscle, brain, and mucosal cells. Other tissue and cells 
displaying gpl20r include colon, thymus, heart, T cells, B cells and macrophages. The 
distribution of tissue having gpl20r parallels that for binding of gpl20 which is not 
blocked by CD4 antibodies, and for HIV infection which is not neutralized by soluble 

30 CD4. This observation suggests a role for gpl20r in viral infection. 

In gpl20r expressing transfected COS cells, gpl20 is rapidly mtemalized following 
binding to gpl20r. This binding and internalization of gpl20 is inhibited by compounds 
such as mannan, concanavalin A and pradimicin A. 

In the present invention a cDNA which encodes gpl20 was isolated and cloned. A 

35 DNA molecule of the present invention corresponds to a complementary DNA molecule 
which transcribes a messenger RNA (mRNA) molecule which, when translated, encodes 
gpl20r. The cDNA molecules were obtained by reverse-transcribing mRNA molecules 
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isolated from mammalian tissue such as placenta, colon, brain or thymus. The 
transcription and cloning of cDNA molecules and isolation of gene products are techniques 
well known in the art and, for example, are described in Sambrook et al., "Mplgcyily 
Cloning: A Laboratory Manual ". 2d edition, Cold Spring Harbor Lab., Cold Spring 
5 Harbor, NY (1989), which is incorporated herein by reference. 

As used herein, the phrases "physiologically tolerable" and "pharmaceutical^ 
acceptable" refer to molecular entities and compositions that do not produce an allergic or 
similar untoward reaction, such as gastric upset, dizziness and the like, when administered 
to a mammal. The physiologically tolerable carrier may take a wide variety of forms 
10 depending upon the preparation desired for administration and the intended route of 
administration. 

A carrier is a material useful for administering the active compound and must be 
"acceptable* in the sense of being compatible with the other ingredients of the composition 
and not deleterious to the recipient thereof. 
15 The pharmaceutical compositions are,prepared by any of the methods well known in 

the art of pharmacy all of which involve bringing into association the active compound and 
the carrier therefor. 

For therapeutic use, the agent utilized in the present invention can be administered 
in the form of conventional pharmaceutical compositions. Such compositions can be 
20 formulated so as to be suitable for oral or parenteral administration, or as suppositories. In 
these compositions, the agent is typically dissolved or dispersed in a physiologically 
tolerable carrier. 

As an example, the compounds of the present invention can be utilized in liquid 
compositions such as sterile suspensions or solutions, or as isotonic preparations containing 

25 suitable preservatives. Particularly well suited for the present purposes are injectable 
media constituted by aqueous injectable isotonic and sterile saline or glucose solutions. 
Additional liquid forms in which the present compounds may be incorporated for 
administration include flavored emulsions with edible oils such as cottonseed oil, sesame 
oil, coconut oil, peanut oil, and the like, as well as elixirs and similar pharmaceutical 

30 vehicles. 

The present agents can also be administered in the form of liposomes. As is known 
in the art, liposomes are generally derived from phospholipids or other lipid substances, 
liposomes are formed by mono- or multi-lamellar hydrated liquid crystals that are 
dispersed in an aqueous medium. Any non-toxic, physiologically acceptable and 
35 metabdlizable lipid capable of forming liposomes can be used. The present compositions 
in liposome form can contain, in addition to the agent of the present invention, stabilizers, 
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preservatives, expedients, and the like. The preferred lipids are the phospholipids and the 
phosphatidyl cholines (lecithins), both natural and synthetic. 

Methods to form liposomes are known in the art. See, for example, Prescott, Ed., 
"Methods in Cell Biology ". Volume XTV, Academic Press, New York, N.Y. (1976) p 33 
5 etseq. 

The present compounds can also be used in compositions such as tablets or pills, 
preferably containing a unit dose of the compound. To this end, the agent (active 
ingredient) is mixed with conventional tabletting ingredients such as corn starch, lactose, 
sucrose, sorbitol, talc, stearic acid, magnesium stearate, dicalcium phosphate, gums or 

10 similar materials as non-toxic, physiologically tolerable carriers. The tablets or pills of the 
present compositions can be laminated or otherwise compounded to provide unit dosage 
forms affording prolonged or delayed action. 

It should be understood that in addition to the aforementioned carrier ingredients the 
pharmaceutical formulation described herein can include, as appropriate, one or more 

15 additional carrier ingredients such as diluents, buffers, flavoring agents, binders, surface 
active agents, thickeners, lubricants, preservatives excluding antioxidants) and the like, 
and substances included for the purpose of rendering the formulation isotonic with the 
blood of the intended recipient. 

The tablets or pills can also be provided with an enteric layer in the form of an 

20 envelope that serves to resist disintegration in the stomach and permits the active 
ingredient to pass intact into the duodenum or to be delayed in release. A variety of 
materials can be used for such enteric layers or coatings, including polymeric acids or 
mixtures of such acids with such materials as shellac, shellac and cetyl alcohol, cellulose 
acetate, and the like. A particularly suitable enteric coating comprises a styrene-maleic 

25 acid copolymer together with known materials that contribute to the enteric properties of 
the coating. 

A method of inhibiting HTV infection of mammalian cells is disclosed in the present 
invention. A pharmaceutical composition containing a compound which effectively inhibits 
the binding of gpl20r to HTV, is contacted with cells either in vitro or in vivo for a time 
30 period sufficient to rignfficanuy inhibit the bmdmg of HIV to the cell surface. 

Compounds effective in this method include mahnose carbohydrates, fucose 
carbohydrates, plant lectins and pradimicin A antibiotics. Specifically preferred 
compounds are mannose, fucose, mannan, concanavalin A and pradimicin A. The 
pharmaceutical composition of the present invention includes a compound which effectively 
35 inhibits gpl20r binding to HIV and may also include a physiologically tolerable carrier. 

The method of the present invention is preferably utilized to inhibit HTV infection 
of placental, brain, muscle, neural and colon cells. 
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A diagnostic method is also described in the present invention for detecting the 
presence, and preferably the amount, of HIV present in a fluid sample by producing a 
reaction product containing HIV bound to gpl20r. Those skilled in the art will recognize 
that there are well known clinical diagnostic procedures that can be utilized for the 
5 formulation and detection of such reaction products. Thus, while exemplary assay methods 
are described herein, the invention is not intended to be so limited. 

Various heterogeneous and homogeneous assay protocols can be employed for 
detecting the presence, and preferably the amount, of HIV in a fluid sample. For example, 
the present invention contemplates a method for assaying a sample, such as a body fluid, 
10 for the presence of HIV comprising the steps of: 

(a) admixing a fluid sample with gp!20r, either in solution or affixed to a solid 
matrix; 

(b) maintaining the admixture for a predetermined time period such as about 10 
minutes to about 16 - 20 hours and under biological assay conditions at a 

15 temperature of about 4°C to about 45°C that is sufficient for any HIV 

present in the sample to react with (bind) the gpl20r to form a reaction 
r product; and 

(c) determining the presence of any reaction product that is formed, and thereby 
the presence of any HIV in the admixture. 

20 Preferably, the fluid sample is a body fluid sample, such as blood, plasma, serum, 

urine, saliva, semen or cerebrospinal fluid (CSF). 

The determination of the presence of a reaction product, either directly or 
indirectly, can be accomplished by assay techniques well known in the art such as by the 
use of an indicating or labelling means, as discussed hereinbelow. In a preferred 

25 embodiment, a labelled indicating means, such as a fluorescein-labelled antibody, is 
capable of binding to the gpl20r present in the reaction product to form a labelled 
complex. Determining the presence of the labelled complex provides an assay for the 
presence of HIV in the sample. In particularly preferred embodiments, the amount of 
labelled indicating means bound as part of the complex is determined, and thereby the 

30 amount of HIV present in the sample is determined. When that amount is zero, no HIV is 
present in the sample, within the limits of detection. Methods for assaying the presence 
and amount of a labelled indicating means depend on the label used, such labels and assay 
methods being well known in the art 

In a preferred embodiment, the gpl20r is affixed on a solid matrix to form a solid 

35 phase support In that embodiment, the assay is heterogeneous, solid/liquid phase assay 
and, as such, has its own preferred manipulations. For example, following admixing of a 
liquid sample with a solid support containing gpl20r affixed thereto, the admixture is 
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stained under biological assay conditions for a time period sufficten, for any 
p^t in the sample ,0 bind to 8Pl20r and form a solid phase bound ration ps«tact. 
L solid and liquid phases are men sepamted to remove any material > the sample M 
dd no. react with me solid support, ttuch as by rinsing. This removes any matenaJ present 

5 in the ample that could interfere with the detection of the reaction Product. . 

A labelled indicating means is then admixed with the sepastded sohd phase m an 
anueous medium In form a solid/liquid phase labeutag-reaction admixrine which is 
STIed for a time period sufficien. for the indicating means » hind «o ft. »hdbo«nd 
^ product forming a labelled complex. The solid phase is flten separated ftomttte 

10 liquid phL, rinsed and me presence, and preferably amount, of me Seating means 

present is determined. ^ 

As used herein, the term "biological assay conditions- refers to parameters that 

rcaintain the biological activity of the molecules and organisms in the present invent™ 

and include a temperature range of about 4"C to about 45»C, a P H value 
15 to about 9, and an ionic strength varying from that of distilled water to that of about one 

molar sodium chloride. Methods for optimizing such conditions are well known » the art 
As used herein, the term "about" refers to a range of values both greater than 

and/or less than the listed value by 10% or less. For example, a temperature of about 20 

C will include temperature values of from 18° C to 22° C. . nc 

20 As used herein, the term "corresponds", and its various grammatical modifications, 

means "is similar or in agreement with". 

A diagnostic system in kit form for assaying a fluid sample for the presence of HIV 

is also contemplated by the present invention. Such a kit includes, in an amount suffiaent 

for at least one assay, gpl20r as a packaged reagent, together with mictions for us* An 
25 indicating means capable of detecting or signalling the presence of a reaction prod** 

formed between gpl20r and HIV may also be present in the kit as a separately packaged 

"""^As used herein, the term "instructions for use" typically includes a tangible 
expression describing the reagent concentration or at least one assay method parameter 
30 such as the relative amounts of xeagent and sample to be admixed, mamtenance time 
periods for admixtures, temperature, buffer conditions and the lake. 

The packaging materials discussed herein in relation to diagnostic systems are those 
customarily utilized. Such materials include glass and plastic (e.g. polyethylene, 
polypropylene and polycarbonate) bottles, vials, plastic and plastic-foil laminated envelopes 

35 and the like. . , 

As used herein, the term "package" refers to a solid material such as glass, plastic, 

paper, foil and the like capable of holding within fixed limits the gpl20r, and preferably 
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also a detection means. In one embodiment, the package can contain a microliter plate 
well to which micrognun quantities of gpl20r have been opeiatively affixed, ie., linked so 
as to be capable of reacting with and bind HIV and/or gpl20. 

As used herein, the terms "label" "indicating means? and "labelled indicating 

5 means", in their various grammatical forms refer to single atoms and molecules that are 
either directly or indirectly involved in the production of a detectable signal to indicate or 
detect the presence of a reaction product. Such labels are themselves well known in 
clinical diagnostic chemistry and constitute a part of this invention only insofar as they are 
utilized with otherwise novel methods and/or systems. 

10 The indicating means can be a fluorescent labelling agent that chemically binds to 

antibodies or protein antigens without denaturing them to form a fluorochrome (dye) that is 
a useful immunofluorescent tracer. Suitable fluorescent labelling agents are fluorochrome, 
such as fluorescein isocyanate (FIC), fluorescein isothiocyanate (FTTC), 5-dimethylamine- 
1-naphthalene sulfonyl chloride (DANSC), tetramewykhodantine isocyanate (TRITC), 

15 tissarnine and the like. Immunofluorescence analysis techniques are well known in the art, 
and for example, is described in DeLuca, "Immunofluorescence Analysis" in 
Tmmimofluorescence Analysis . Marchalonis et al., (1982) eds., John Wiley & Sons, Ltd., 
pp. 189-231, which is incorporated herein by reference. 

Other preferred indicating means are colorimetric agents and enzymes, such as 
= 20 horseradish peroxidase, glucose oxidase or the like, linked as described above, as well as 
radioactive elements, preferably an element that produces gamma ray emissions. Elements 
which emit gamma rays,.such as 124 I, 125 I, 128 I, 132 I, and 51 Cr represent one class of 
radioactive indicating groups. Another group of useful labelling means are those elements 
such as 11 C, 18 F, ^O and 13 N which emit positrons. The positrons so emitted produce 

25 gamma rays upon interaction with electrons present. 

Having generally described this invention, a further understanding can be obtained 
by reference to certain specific examples which are provided herein for purposes of 
illustration only and are not intended to be limiting unless otherwise specified. 

30" . EXAMPLE 1 

f?loninp and Isolation of Non -CT>4 Gp140 Receptor Protein 

Human placental membranes were found to be able to bind vaccinia derived 
recombinant gpl20 (vgpl20) with a Kd of 1.3 nM. At nM (concentrations) of gpl20 none 
35 of this binding was inhibited by an antibody (G17-2) which has been reported to efficiently 
block gp!20 binding to CD4 (Linsley et al. (1988) J. Virol. &:3695-3702), as shown in 
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FIGURE 2A. Approximately 50 - 90% of the total placental gpl20 binding was not due to 
CD4. 

A placental cDNA Hbiary was obtained in the mammalian expression vector 
pCDM8 and was screened. A cDNA was isolated which expressed protein that exhibited 
5 high affinity binding for vgpl20 in the presence of G17-2. 

This protein, designated as gpl20 receptor (gpl20r), also bound native gpl20 
(ngpl20), and the binding component was precipitated in the presence of an antibody 
directed against gpl20. 

10 EXAMPLE 2 

Characterization 

The binding of radiolabelled gpl20 to gpl20r expressed in COS-7 cells was 
studied. Pools of 90,000 cDNA molecules, obtained from a placental pCDM8 library, 

15 were transfected by electroporation into COS-7 cells. Cells which expressed gpl20r onfce 
surface was identified by screening with either 1 nM of 125 I-labelled vgpl20 ( I- 
vgpl20) or ^I-ngp^ by the method described in Kozlowski et al., (1990) Antivir. 
Chem. Chemother. 1:175-182, incorporated herein by reference. The results of binding 
studies utilizing the transfected COS-7 cells are shown in FIGURE 1. 

20 Binding of labelled gpl20 (1 nM) to the cells was carried out following a 1 hour 

preincubation of the cells or GP120 at 22°C with one or more of the following: anti-CD4 
antibody G17-2 (5 ug/ml), baculovirus-derived gpl20 (bgpl20, American Biotechnologies, 
200 nM), anti-gpl20 monoclonal antibody 110.1 (25 jig/ml), D-mannose (100 mM), D- 
galactose a00 mM),>fucose (100 mM), concanavalin A (1 mg/ml) or pradimicin A (100 

25 ug/ml). The cells were monitored after autoradiography (3 days). The results seen in 
FIGURES 1 (A and B) illustrate that gpl20 binding to the gpl20r expressed on the cells 
was blocked by excess bgpl20, mannose, fucose, pradimicin A, Concanavalin A, and 
preincubation with antibody 110.1 but not by CD4, antibody G17-2, galactose, or HIV 
antisera. Studies were also carried out on gpl20 binding to CD4 expressing COS cells, 

30 transfected with it H3MCD4 by the method of Peterson et aL (1988) Cell 51:65-72. 

Control studies of the binding of 125 I-labelled psoralen-UV inactivated HTV-BRU 
to the gpl20r expressing COS-7 cells demonstrated binding of HTV to gpl20r and blockage 
by excess bgpl20 (FIGURE 1Q. A. tabular compilation quantitating the amount of bound 
material to the cells in FIGURE 1 is shown in Table. 1 
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30 Scatchard plots of gpl20 binding to placental membranes and to COS cells 

expressing the gpl20r were carried out in the presence and absence of a 200 fold excess of 
bgpl20 or ngpl20. The results, shown in FIGURE 2A, demonstrate a specific binding of 
vgpl20 to gpl20r with a Kd of 1.7 nM ± 0.4 (n=4) and of ngpl20 to gpl20r with Kd of 
1.8 nM ± 0.2 (n=4), with 150,000 and 149,000 receptors per cell, respectively. 

35 Concurrent analysis of gpl20 binding to CD4 expressed on COS cells gave a Kd of 4-5 nM 
in agreement with previous reports (Linsley, P.S. et al. (1988) J. Virol £2:3695-3702; 
Schmttman, et al. (1988) J. Immunol. 141:4181-4186). Calculations from the association 
and dissociation rate constants gave a similar comparative result. The expressed gpl20r 
has a relative molecular mass (Mr) of "48,500 and a protein of similar size was also 

40 partially purified from placental membranes (FIGURE 2D). 

The placental membranes and COS cells were surface iodinated, and treated with 1 
nM unlabelled vgpl20, then washed with Blotto RPMI, 5% BSA, 1% Non-fat dry milk, 
0.2% sodium azide solubilized in Triton X-100 (1% in PBS with a protein inhibitor 
cocktail, PMSF, Pepstatin A, orthophenathroline and leupeptin) and immunoprecipitated 



BNSOOCtO: <WO 9301S20A2_L> 



-14- 



10 



15 



with HIV or control human sera, according to the method described in Curtis et al. (1990) 
J. Immunol. 144:1295-1303. . 

Northern analysis of the expression of the gpl20r RNA indicated a major species of 
-5 kb and a minor species of '1.7 kb which may represent an alternatively processed 
transcript and is more consistent with the size of the gpl20r cDNA. RNA was denatured, 
separated in an agarose gel, transferred to nitrocellulose, hybridized to gpl20r cDNA and 

autoradiographed for 3 days. 

Expression of gpl20r RNA was highest in colon followed by thymus, placenta, 
heart, skeletal muscle, and was not detected in liver or kidney. Low levels of expression 
in brain, T cell, B cell, and macrophage (FIGURE 2E) require verification by polymerase 
chain reaction (PCR). Full length CD4 RNA was highest in thymus, T cell, and 
macrophage followed by placenta and colon (not shown). 

The gpl20r cDNA encodes a protein of 404 amino acids with a calculated Mr of 

45,775 (FIGURE 3A). 

Sequencing of both strands of gpl20r cDNA was carried out by the dideoxy chain 
termination method. The nucleotide sequence proceeding the first ATG agrees with the 
Kozak consensus. The predicted cytoplasmic domain has a similar length and shows some 
sequence homology to other type H membrane protein C-type lectins (Spiess, M. (1990) 
Biochemistry 22:10009-10018). The membrane spanning sequence is underlined and was 
predicted in part by homology to related sequences in FIGURE 3C. The potential N- 
linked glycosylation site is marked by an- asterisk. The start of the seven complete and 
eighth partial tandem repeats are indicated (R1-R8). The consensus repeat sequence is 
IYQELT(R/Q) LKAAVGELPEKSKLQE. The beginning of the lectin domains is also 
indicated (L). No signal sequence was apparent but instead demonstrated homology to a 
25 family of Type H membrane proteins which utilize a '20 residue hydrophobic stop-transfer 
sequence for membrane translocation. The "positive inside rule" (von Heijne, G. et al. 
(1988) Eur. J. Biochem. 124:671-678) for the sequence within fifteen residues of the 
transmembrane region predicts a cytoplasmic amino terminus in agreement with the 
homology to membrane associated C-type lectins with similar membrane orientation 
30 (FIGURE 3Q (Spiess, M. (1990) Biochemistry 21:10009-10018). This region, Met 1 to 
Ala 76, represents the first domain of the gpl20r sequence. 

The second domain (He 77 to Val 249) consists of tandem repeats of nearly 
identical sequence (FIGURE 3A). This region was predicted to consist of a series of 
amphipathic a-helices interrupted by B-turns. Circular Dichroism spectra in 40% 
35 trifluoroethanol of a consensus repeat peptide beginning with the B-turn, 
PEXSKLQEIYQELTQLKAAVGEL (single-letter amino-acid code), demonstrated an all a- 
helical structure (not shown). Homology to other repeat domains suggested three possible 
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tertiary structures, (1) antiparallel helix bundles, (2) a multimeric parallel helix bundle, and 
(3) a membrane pore with a hydrophobic exterior and a negatively charged interior. The 
first two models would function as spacers to separate the lectin domain from the 
membrane, while the third could generate a transmembrane signal after ligand binding. 
5 The third domain (Cys 253 to Ala 404) is homologous to the other known C-type 

lectins which are type II membrane proteins (FIGURE 3C). With the exception of the 
IgEr, these lectins bind terminal D-galactose and D-N-acetylgalactosamine of glycoproteins 
(Spiess, M. (1990) Biochemistry 2<>: 10009-10018). 

The most closely related sequences were the group of Type II membrane protein Cl- 
io type lectins: Chick hepatic lectin (CHL) (Drickamer, KJ. (1981) Biol. Chem. 25£:5827- 
5839), low affinity IgE receptor (IgEr) (Kikutani, H. et al. (1986) Cell 47: 657-665), the 
asialoglycoptorein receptors (human HI and H2 (Spiess, M. et al. (1985) Proc. Natl. 
Acad. Sci. USA £2:6465-6569) are shown), and the rat Kupffer cell receptor (Hoyle, 
G.W. et al. (1988) J. Biol. Chem. 263:7487-7492). The most similar mannose binding 
15 lectin was one of the eight carbohydrate recognition domains of the human macrophage 
mannose receptor (Mannr) (Taylor, M.E. et al. (1990) J. Biol. Chem. 2£5: 12156-12162; 

5 Ezekowitz, R.A.B. et al. (1990) J. Exp. Med. 172:1785-1794). Residues identical tfc tec 
...» gpl20r are boxed. ALIGN scores indicate significant sequence similarity if greats thaai ^ - 

6 3.0. The complete gpl20r sequence was most homologous to the Kupffer cell receptor \ 
20 which has a similar tandem repeat (Hoyle, G.W. et al. (1988) J. BioL Chem. 262:7487- 

7492). 

The inability to crosslink gpl20 to the non-CD4 sites on placenta and brain cell 
lines (not shown) was consistent with an interaction of the gpl20r with carbohydrate, and 
polyclonal HIV antisera added to gpl20 blocked binding to CD4 but not to the gpl20r 

25 (FIGURE IB). Galactose and N-acetylgalactosamine did not block gpl20 binding, but 
mannose and fucose completely blocked binding to the gpl20r without an effect on CD4 
(FIGURE IB). Inhibition by a series of sugars is shown in FIGURE 2B. Human IgE (10 
/xg/ml), sialic acid (100 mM), and mannose-6-phosphate (100 mM) had no effect on 
binding to the gpl20r. The three forms of gpl20 used have different oligosaccaride 

30 structured. Bgpl20 contains only high mannose structures (Hsieh, P. et al. (1984) J. Biol. 
Chem. 259:2375-2382). Vgpl20 has equal proportions of high mannose and complex 
(Mizuochi, T. et al. (1988) Biochem. J. 254:599-603) similar to ngpl20 which has a 
greater structural diversity in the complex chains (Geyer, H. et al. (1988) J. BioL Chem. 
261:11760-11767; Mizuochi, T. et al. (1990) J. Biol. Chem. 2£5l85 19-8524). The 

35 affinity of the gpl20r for all three forms was similar (FIGURE 2A ) suggesting that the 
terminal mannose of high mannose chains are the primary determinants of binding. As 
expected for a C-type lectin the gpl20r required calcium and binding was blocked by 
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EGTA (FIGURE 2B). Hie gpl20r carbohydrate specificity is more closely related to 
serum mannose -binding proteins and to the Mr 175,000 mannose-specific endocytosis 
receptor found in macrophages and placenta (Taylor, ME. et al. (1990) J. Biol. Chem. 
2£:12156-12162; Ezekowitz, R.A.B. et al. (1990) J. Exp. Med. 172:1785-1794) 
5 (FIGURE 3Q. Low (1 nM) concentrations of gpl20 did not purify a Mr 175,000 band 
from placental membranes (FIGURE 2D) consistent with a reported concentration of 150- 
300 nM for gpl20 saturation of the macrophage receptor (Larkin, M. et al. (1989) AIDS 
3,793-798). 

The importance of gpl20 carbohydrate in HIV infection has been suggested by the 
10 ability of plant lectins (lifson, J. et al. (1986) E. J. Exp. Med. 1*4:2101-2106) and serum 
mannose-binding protein (Ezekowitz, R.A.B. et al. (1989) J. Exp. Med. m 185-196) to 
block infection, and a proposed role for the macrophage endocytosis receptor in viral 
attachment (Larking M. et al. (1989) AIDS 3, 793-798). Concanavalin A treatment of 
gpl20 blocked binding to the gpl20r and CD4 (FIGURE IB), consistent with a stenc 
15 hindrance of receptor interaction. The antibiotic pradimicin A blocks BUY infection of 
CD4 positive T cells and this inhibitory effect is prevented by mannan and EGTA (Tanabe- 
Tochikura. A. et al. (1990) Virology 17£:476-473). Pradamicin blocked gpl20 binding to 
the gpl20r and CD4, while mannan and EGTA only inhibited binding to the gpl20r 
(FIGURE2B). Mannan fohfoited'W^ 
20 macrophages, consistent with gpl20r expression (FIGURE 2E), suggesting that in addition 
to CD4 the gpl20r may be important for HTV binding and infection. The observation the 
the gpl20r rapidly internalized its bound ligand gpl20 (FIGURE 2Q, and also binds 
radiolabelled HIV in a gpl20 dependent fashion (FIGURE 1Q also support this 
conclusion. 

25 The foregoing description and Examples are intended as illustrative of the present 

invention, but not as limiting. Numerous variations and modifications may be effected 
without departing from the true spirit and scope of the present invention. 
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(2) 
© 



INFORMATION FOR SEQ ID NO:l: 
SEQUENCE CHARACTERISTICS: 



(A) LENGTH: 

(B) TYPE: 

(C) STRANDEDNESS: 

(D) TOPOLOGY: 



1312 base pairs 
nucleic acid 
double 
linear 



(ii) MOLECULE TYPE: cDNA 
(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Human immunodeficiency virus type 1 

fix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 42..1253 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1: 

CTAAAGCAGG AGTTCTGGAC ACTGGGGGAG AGTGGGGTGA C ATG AGT GAC TCC 

Met Ser Asp ser 
1 

AAG GAA CCA AGA CTG CAG CAG CTG GGC CTC CTG GAG GAG GAA CAG CTG 
Lys Glu Pro Arg Leu Gin Gin Leu Gly Leu Leu Glu Glu Glu Gin Leu 
5 10 15 20. 

AGA GGC CTT GGA TTC CGA CAG ACT CCA GGA TAC AAG AGC TTA GCA GGG 
Arg Gly Leu Gly Phe Arg Gin Thr Arg Gly Tyr Lys Ser Leu Ala Gly 
25 3° 35 

TGT CTT GGC CAT GGT CCC CTG GTG CTG CAA CTC CTC TCC TTC ACG CTC 
Cvs Leu Gly His Gly Pro Leu Val Leu Gin Leu Leu Ser Phe Thr Leu 
40 45 50 

TTG GCT GGG CTC CTT GTC CAA GTG TCC AAG GTC CCC AGC TCC ATA AGT 
Leu Ala Gly Leu Leu Val Gin Val Ser Lys Val Pro Ser Ser He Ser 
55 60 65 

CAG GAA CAA TCC AGG CAA GAC GCG ATC TAC CAG AAC CTG ACC CAG CTT 
Gin Glu Gin Ser Arg Gin Asp Ala He Tyr Gin Asn Leu Thr Gin Leu 
70 75 80 

AAA GCT GCA GTG GGT GAG CTC TCA GAG AAA TCC AAG CTG CAG GAG ATC 
Lvs Ala Ala Val Gly Glu Leu Ser Glu Lys Ser Lys Leu Gin Glu He 
85 90 95 100 



53 



101 



149 



197 



245 



293 



341 
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TAC CAG GAG CTG ACC CAG CTG AAG GCT GCA GTG GGT GAG CTT CCA GAG 389 
Tyr Gin Glu Leu Thr Gin Leu Lys Ala Ala Val Gly Glu Leu Pro Glu 
105 HO 115 

AAA TCT AAG CTG CAG GAG ATC TAC CAG GAG CTG ACC CGG CTG AAG GCT 437 
Lys Ser Lys Leu Gin Glu He Tyr Gin Glu Leu Thr Arg Leu Lys Ala 
120 125 130 

GCA GTG GGT GAG CTT CCA GAG AAA TCT AAG CTG CAG GAG ATC TAC CAG 485 
Ala Val Gly Glu Leu Pro Glu Lye Ser Lys Leu Gin Glu He Tyr Gin 
135 140 145 

GAG CTG ACC TGG CTG AAG GCT GCA GTG GGT GAG CTT CCA GAG AAA TCT 533 
Glu Leu Thr Trp Leu Lys Ala Ala Val Gly Glu Leu Pro Glu Lys Ser 
150 155 160 

AAG ATG CAG GAG ATC TAC CAG GAG CTG ACT CGG CTG AAG GCT GCA GTG 581 
Lys Met Gin Glu He Tyr Gin Glu Leu Thr Arg Leu Lys Ala Ala Val 
165 170 175 180 

GGT GAG CTT CCA GAG AAA TCT AAG CAG CAG GAG ATC TAC CAG GAG CTG 629 
Gly Glu Leu Pro Glu Lys Ser Lys Gin Gin Glu He Tyr Gin Glu Leu 
185 190 195 

ACC CGG CTG AAG GCT GCA GTG GGT GAG CTT CCA GAG AAA TCT AAG CAG 677 
Thr Arg Leu Lys Ala Ala Val Gly Glu Leu Pro Glu Lys Ser Lys Gin 
200 205 210 

CAG GAG ATC TAC CAG GAG CTG ACC CGG CTG AAG GCT GCA GTG GGT GAG 725 
Gin Glu He Tyr Gin Glu Leu Thr Arg Leu Lys Ala Ala Val Gly Glu 
215 220 225 

CTT CCA GAG AAA TCT AAG CAG CAG GAG ATC TAC CAG GAG CTG ACC CAG 773 
Leu Pro Glu Lys Ser Lys Gin Gin Glu He Tyr Gin Glu Leu Thr Gin 
230 235 240 

CTG AAG GCT GCA GTG GAA CGC CTG TGC CAC CCC TGT CCC TGG GAA TGG 821 
Leu Lys Ala Ala Val Glu Arg Leu Cys His Pro Cys Pro Trp Glu Trp 
245 ~ 250 255 260 

ACA TTC TTC CAA GGA AAC TGT TAC TTC ATG TCT AAC TCC CAG CGG AAC 869 
Thr Phe Phe Gin Gly Asn Cys Tyr Phe Met Ser Asn Ser Gin Arg Asn 
265 270 275 

TGG CAC GAC TCC ATC ACC GCC TGC AAA GAA GTG GGG GCC CAG CTC GTC 917 
Trp His Asp Ser He Thr Ala Cys Lys Glu Val Gly Ala Gin Leu Val 
280 285 290 

GTA ATC AAA AGT GCT GAG GAG CAG AAC TTC CTA CAG CTG CAG TCT TCC 965 
Val lie Lys Ser Ala Glu Glu Gin Asn Phe Leu Gin Leu Gin Ser Ser 
295 300 305 
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AGA AGT AAC CGC TTC ACC TGG ATG GGA CTT TCA GAT CTA AAT CAG GAA 1013 
Arg Ser Asn Arg Phe Thr Trp Met Gly Leu Ser Asp. Leu Asn Gin Glu 
310 315 320 

GGC ACG TGG CAA TGG GTG GAC GGC TCA CCT CTG TTG CCC AGC TTC AAG 1061 
Glv Thr Trp Gin Trp Val Asp Gly Ser Pro Leu Leu Pro ser Phe Lys 
325 330 335- 340 

CAG TAT TGG AAC AGA GGA GAG CCC AAC AAC GTT GGG GAG GAA GAC TGC 1109 
Gin Tyr Trp Asn Arg Gly Glu Pro Asn Asn Val Gly Glu Glu Asp Cys 
345 350 355 

GCG GAA TTT AGT GGC AAT GGC TGG AAC GAC GAC AAA TGT AAT CTT GCC 1157 
Ala Glu Phe Ser Gly Asn Gly Trp Asn Asp Asp Lys Cys Asn Leu Ala 
350 365 370 

AAA TTC TGG ATC TGC AAA AAG TCC GCA GCC TCC TGC TCC AGG GAT GAA 1205 
Lys Phe Trp He Cys Lys Lys Ser Ala Ala Ser Cys ser Arg Asp Glu 
375 380 385 

GAA CAG TTT CTT TCT CCA GCC CCT GCC ACC CCA AAC CCC CCT CCT GCG 1253 
Glu Gin Phe Leu Ser Pro Ala Pro Ala Thr Pro Asn Pro Pro Pro Ala 
390 395 400 

TAGCAGAACT TCACCCCCTT TTAAGCTACA GTTCCTTCTC TCCATCCTTC GACCTTTAG 1312 

(2) INFORMATION FOR SEQ ID NO: 2: 
Ci) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 404 amino acids bC-S^W 

(B) TYPE: amino acid ^ ^| (^Cj^^x <ri!~4 
(D) TOPOLOGY: linear 7 

(u) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQIDNO:2: 

Met Ser Asp Ser Lys Glu Pro Arg Leu Gin Gin Leu Gly Leu Leu Glu 
1 5 10 15 

Glu Glu Gin Leu Arg Gly Leu Gly Phe Arg Gin Thr Arg Gly Tyr Lys 
20 25 30 

Ser Leu Ala Gly Cys Leu Gly His Gly Pro Leu Val Leu Gin Leu Leu 
35 40 45 

Ser Phe . Thr Leu Leu Ala Gly Leu Leu Val Gin Val Ser Lys Val Pro 
SO 55 60 
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Ser Ser lie Ser Gin Glu Gin Ser Arg Gin Asp Ala lie Tyr Gin Asn 
65 70 75 80 

Leu Thr Gin Leu Lys Ala Ala Val Gly Glu Leu Ser Glu Lys Ser Lys 
85 90 . 95 

Leu Gin Glu He Tyr Gin Glu Leu Thr Gin Leu Lys Ala Ala Val Gly 
100 ' 105 110 

Glu Leu Pro Glu Lys Ser Lys Leu Gin Glu He Tyr Gin Glu Leu Thr 
115 120 125 

Arg Leu Lys Ala Ala Val Gly Glu Leu Pro Glu Lys Ser Lys Leu Gin 
130 135 140 

Glu He Tyr Gin Glu Leu Thr Trp Leu Lys Ala Ala Val Gly Glu Leu 
145 150 155 160 

Pro Glu Lys Ser Lys Met Gin Glu lie Tyr Gin Glu Leu Thr Arg Leu 
165 170 175 

Lys Ala Ala Val Gly Glu Leu Pro Glu Lys Ser Lys Gin Gin Glu He 
180 185 190 

Tyr Gin Glu Leu Thr Arg Leu Lys Ala Ala Val Gly Glu Leu Pro Glu 
195 200 205 

Lys Ser Lys Gin Gin Glu He Tyr Gin Glu Leu Thr Arg Leu Lys Ala 
210 215 220 

Ala Val Gly Glu Leu Pro Glu Lys Ser Lys Gin Gin Glu He Tyr Gin 
225 ^ 230 235 240 

Glu Leu Thr Gin Leu Lys Ala Ala Val Glu Arg Leu Cys His Pro Cys 
245 250 255 

Pro Trp Glu Trp Thr Phe Phe Gin Gly Ash Cys Tyr Phe Met Ser Asn 
260 265 270 

Ser Gin Arg Asn Trp His. Asp Ser He Thr Ala Cys Lys Glu Val Gly 
275 280 285 

Ala Gin Leu Val Val lie Lys Ser Ala Glu Glu Gin Asn Phe Leu Gin 
290 295 300 

Leu Gin Ser Ser Arg Ser Asn Arg Phe Thr Trp Met Gly Leu Ser Asp 
305 310 315 320 

Leu Asn Gin Glu Gly Thr Trp Gin Trp Val Asp Gly Ser Pro Leu Leu 
325 330 335 

Pro Ser Phe Lys Gin Tyr Trp Asn Arg ciy Glu Pro Asn Asn Val Gly 
340 345 350 
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Glu Glu Asp Cys Ala Glu Phe Ser Gly Asn .Gly Trp Asn Asp Asp Lys 
355 360 365 

lie Cys Lys Lys Ser Ala Ala Ser Cys 



Cys Asn Leu Ala Lys Phe Trp 
370 375 



380 



Ser Arg Asp Glu Glu Gin Phe Leu Ser Pro Ala Pro Ala Thr Pro Asn 



385 

Pro Pro Pro Ala 



390 



395 



(2) INFORMATION FOR SEQ ID NO: 3: 
fi) SEQUENCE CHARACTERISTICS: 



(A) LENGTH: 

(B) TYPE: 

(D) TOPOLOGY: 

fii) MOLECULE TYPE: 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 
(A) ORGANISM: 



127 amino acids 
amino acid 
linear 

protein 

internal 



Human immunodeficiency virus type 1 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 



Cys His Pro Cys Pro Trp Glu Trp Thr Phe Phe Gin Gly Asn Cys Tyr 



10 



Phe Met Ser Asn Ser . Gin Arg Asn Trp His Asp Ser He Thr Ala Cye 
20 25 30 

Lys Glu Val Gly Ala Gin Leu Val Val He Lys Ser Ala Glu Glu Gin 
7 35 40 45 

Asn Phe Leu Gin Leu Gin Ser Ser Arg Ser Asn Arg Phe Thr Trp Met 
SO 55 60 

Gly Leu Ser Asp Leu Asn Gin Glu Gly Thr Trp Gin Trp Val Asp Gly 



65 70 75 



Ser Pro Leu Leu Pro Ser Phe Lys Gin Tyr Trp Asn Arg Gly Glu Pro 
85 90 95 
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Asn Asn Val Gly Glu Glu Asp Cys Ala Glu Phe Ser Gly Asn Gly Trp 
100 105 HO 

Asn Asp Asp Lys Cys Asn Leu Ala Lys Phe Trp lie Cys Lys Lys 
115 120 . 125 

(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 126 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

Cys Gly Ala Gin Ser Arg Gin Trp Glu Tyr Phe Glu Gly Arg Cys Tyr 
1 5 10 I 5 

Tyr Phe Ser Leu Ser Arg Met Ser Trp His Lys Ala Lys Ala Glu Cys 
20 25 30 

Glu Glu Met His Ser His Leu He He He Asp Ser Tyr Ala Lys Gin 
35 40 45 

Asn Phe Val Met Phe Arg Thr Arg Asn Glu Arg Phe Trp He Gly Leu 
50 55 60 

Thr Asp Glu Asn Gin Glu Gly Glu Trp Gin Trp Val Asp Gly Thr Asp 
65 70 75 80 

Thr Arg Ser Ser Phe Thr Phe Trp Lys Glu Gly Glu Pro Asn Asn Arg 
85 90 95 

Gly Phe Asn Glu Asp Cys Ala His Val Trp Thr Ser Gly Gin Trp Asn 
100 105 HO 

Asp Val Tyr Cys Thr Tyr Glu Cys Tyr Tyr Val Cys Glu Lys 
115 120 125 

(2) INFORMATION FOR SEQ ID NO: 5: 

SUBSTITUTE SHEET 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 125 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 



(xi) SEQUENCE DESCRIPTION: SEQIDNO: 5: 



Cys Asn Thr Cys Pro Glu Lys Trp lie Asn Phe Gin Arg Lys Cys Tyr 
1 5 10 .15 

Tyr Phe Gly Lys Gly Thr Lys Gin Trp Val His Ala Arg Tyr Ala Cys 
20 25 30 

Asp Asp Met Glu Gly Gin Leu Val Ser He His Ser Pro Glu Glu Gin 
35 40 45 

Asp Phe Leu Thr Lys His Ala Ser His Thr Gly Ser Trp He Gly Leu 
50 55 60 

Arg Asn Leu Asp Leu Lys Gly Glu Phe He Trp Val Asp Gly ser His 
65 " 70 75 80 

Val Asp Tyr Ser Asn Trp Ala Pro Gly Glu Pro Thr ser Arg Ser Gin 
B5 90 95 

Gly Glu Asp Cys Val Met Met Arg Gly Ser Gly Arg Trp Asn Asp Ala 
100 105 110 

Phe Cys Asp Arg Lys Leu Gly Ala Trp Val Cys Asp Arg 
115 120 125 



(2) INFORMATION FOR SEQIDNO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 129 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
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(v) FRAGMENT TYPE: internal 



(xi) SEQUENCE DESCRIPTION: SEQIDNO: 6: 



Arg Thr Cys Cys Pro Val Asn Trp Val Glu His Glu Arg Ser Cys Tyr 
1 5 10 15 

Trp Phe Ser Arg Ser Gly Lys Ala Trp Ala Asp Ala Asp Asn Tyr Cys 
20 25 30 

Arg Leu Glu Asp Ala His Leu Val Val Val Thr Ser Trp Glu Glu Gin 
35 40 45 

Lys Phe Val Gin His His lie Gly Pro Val Asn Thr Trp Met Gly Leu 
50 55 60 

His Asp Gin Asn Gly Pro Trp Lys Trp Val Asp Gly Thr Asp Tyr Glu 
65 70 75 80 

Thr Gly Phe Lys Asn Trp Arg Pro Glu Gin Pro Asp Asp Trp Tyr Gly 
85 90 95 

His Gly Leu Gly Gly Gly Glu Asp Cys Ala His Phe Thr Asp Asp Gly 
100 105 110 

Arg Trp Asn Asp Asp Val Cys Gin Arg Pro Tyr Arg Trp Val Cys Glu 
115 120 125 

Thr 



(2) INFORMATION FOR SEQ ID NO: 7: 
Ci) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 129 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(v) FRAGMENT TYPE: interna] 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

Arg Thr Cys Cys Pro Val Asn Trp Val Glu His Gin Gly Ser Cys Tyr 
- x 5 10 - 15 

Trp Phe Ser His Ser Gly Lys Ala Trp Ala Glu Ala Glu Lys Tyr Cys 
20 25 30 

Gin Leu Glu Asn Ala His Leu Val Val He Asn Ser Trp Glu Glu Gin 
35 40 45 

Lys Phe He Val Gin His Thr Asn Pro Phe Asn Thr Trp He Gly Leu 
50 55 60 

Thr Asp Ser Asp Gly Ser Trp Lys Trp Val Asp Gly Thr Asp Tyr Arg 
65 70 75 80 

His Asn Tyr Lys Asn Trp Ala Val Thr Gin Pro Asp Asn Trp His Gly 
85 90 95 

HiB Glu Leu Gly Gly Ser Glu Asp Cys Val Glu Val Gin Pro Asp Gly 
100 105 HO 

Arg Trp Asn Asp Asp Phe Cys Leu Gin Val Tyr Arg Trp Val Cys Glu 
115 120 125 

Lys 

(2) INFORMATION FOR SEQ ID NO: 8: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 130 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

Cii) MOLECULE TYPE: protein 

(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

Leu Gin Leu lie Met Gin Asp Trp Lys Tyr Phe Asn Gly Lys Phe Tyr 
1 5 10 .15 
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Tyr Phe Ser Arg Asp Lys Lys Ser Trp His Glu Ala Glu Asn Phe Cys 
20 25 30 

Val Ser Gin Gly Ala His Leu Ala Ser Val Thr ser Gin Glu Glu Gin 
35 40 45 

Ala Phe Leu Val Gin He Thr Asn Ala Val Asp His Trp He Gly Leu 
50 55 60 

Thr Asp Gin Gly Thr Glu Gly Asn Trp Arg Trp Val Asp Gly Thr Pro 
65 ' 70 75 80 

Phe Asp Tyr Val Gin Ser Arg Arg Phe Trp Arg Lys Gly Gin Pro Asp 
85 90 95 

Asn Trp Arg His Gly Asn Gly Glu Arg Glu Asp Cys Val His Leu Gin 
100 105 HO 

Arg Met Trp Asn Asp Met Ala Cys Gly Thr Ala Tyr Asn Trp Val Cys 
115 120 125 



Lys Lys 
130 



(2) INFORMATION FOR SEQ ID NO: 9: 
0) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 130 amino, acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

Pro Thr His Cys Pro Ser Gin Trp Trp Pro Tyr Ala Gly His Cys Tyr 
1 5 1Q 15 

Lys He His Arg Asp Glu Lys Lys He Gin Arg Asp Ala Leu Thr Thr 
20 25 30. 

Cys Arg Lys Glu Gly Gly Asp Leu Thr Ser He His Thr He Glu Glu 
35. 40 45. 
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I.eu Asp Phe lie lie Ser Gin Leu Gly Leu Glu Pro Asn Asp Glu Leu 

* 50 55 
Trp lie Gly Leu Asn Asp lie Lys lie Gin Met Tyr Phe Glu Trp Ser 



65 /u 

Gly Thr Pro Val Thr Phe Thr Lys Trp Leu Arg Gly Glu Pro Ser 
85 90 

His Glu Asn Asn Arg Gin Glu Asp Cys Val Val Met Lys Gly Lys Asp 
100 105 

Gly Tyr Trp Ala Asp Arg Gly Cys Glu Trp Pro Leu Gly Tyr He Cys 
115 120 

Lys Met 
130 
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We claim : 

1. A method of inhibiting HIV infection of mammalian cells comprising contacting the 
cells with an effective amount of a compound selected from the group consisting of 
a mannose carbohydrate, a fucose carbohydrate, a lectin and a drug, for a time 

... period sufficient to significantly inhibit the binding of HIV to a non-CD4 cell 
surface protein. 

2. The method of Claim 1, wherein the non-CD4 cell surface protein is a gp!20 
receptor having a specific binding affinity for gpl20 of about Kd = 1.3 nM to 
about Kd = 2.0 nM. 

3. The method of Claim 2, wherein the gpl20 receptor is present on placental cells. 

4. The method of Claim 2, wherein the gpl20 receptor is present on muscle cells. 

5. The method of Claim 2, wherein the gpi20 receptor is present on neural cells. 

6. The method of Claim 5, wherein the neural cells are brain cells. 

7. The method of Claim 5, wherein the neural cells are dendritic cells. 

8. The method of Claim 2, wherein the gpl20 receptor is present on mucosal cells. 
10. The method of Claim 1, wherein the compound is mannose. 

10. The method of Claim 1, wherein the compound is fucose, 

11. The method of Claim 1, wherein the compound is a mannose-containing 
carbohydrate. 

12. The method of Claim 11, where the carbohydrate is mannan. 

13. The method of Claim 1, wherein the compound is a pradimicin A antibiotic. 

14. A substantially purified non-CD4 gpl20 receptor protein comprising a protein 
substantially corresponding to a non-CD4 mammalian cell surface protein that has a 
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specific binding affinity for gpl20, said protein containing about 400 amino acid 
residues, having a molecular weight of about 45,000 daltons and having a binding 
affinity for gpl20 characterized by a Kd of about 1.3 nM to about 2 nM. 

15. The gpl20 receptor protein of Claim 14, wherein the binding of the gpl20 receptor 
protein to gpl20 is inhibited by a compound selected from the group consisting of a 
mannose carbohydrate, a fucose carbohydrate, a lectin and a drug. 

16. The gpl20 receptor of Claim 15, wherein the compound is mannose. 

17. The gpl20 receptor protein of Claim 15, wherein the compound is a pradimicin A 
antibiotic. 

18. The gpl20 receptor protein of Claim 14, wherein the protein is produced by 
recombinant means. 

19. The gpl20 receptor protein of Claim 18, wherein said recombinant means 
comprises the cloning of a cDNA isolated from a library of recombinant placental 
genes. 

20. A DNA molecule encoding the gpl20 receptor protein of Claim 14, wherein the 
DNA is a complementary DNA that transcribes an mRNA found in cells selected 
from the group consisting of placental cells, brain cells, muscle cells and colon 
cells. 

21 . A method of detecting die presence of HTV in a sample comprising: 

(a) admixing in an aqueous medium a sample to be assayed with a non- 
CD4 gpl20 receptor protein having a specific binding affinity for gpl20 
characterized by a Kd of about 1.3 nM to about 2.0 nM in an amount 
sufficient to carry out at least one assay; 

(b) maintaining the admixture for a time period sufficient for the gpl20 
receptor protein to bind to any HTV present in the sample and form a 
reaction product; and 

(c) determining the presence of the HTV containing reaction product. 
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22. The method of Claim 21, wherein the gpl20 receptor protein contains about 
400 amino acid residues and has a molecular weight of about 45,000 
daltons. 

23. The method of Claim 21, wherein the gpl20 receptor protein is affixed to a 
solid matrix to form a solid support. 

24. The method of Claim 21, wherein the presence of the reaction product is 
determined by contacting the sample with a reagent capable of detecting the 
bound gpl20 receptor protein. 

25. The method of Claim 24, wherein the reagent is a labelled antibody directed 
against the gpl20 receptor protein. 

26. A diagnostic system in kit form, for assaying for the presence of HIV in a 
fluid sample, comprising a package containing a non-CD4 receptor protein 
having a specific affinity for gpl20 characterized by a Kd of about 1.3 nM 
to about 2.0 nM, and instructions for use. 

21. The diagnostic system of Claim 26, wherein the non-CD4 gpl20 receptor 

protein is affixed to a solid matrix to form a solid support 
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Figure 1C 
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Figure 2B 
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Figure 2C 
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x CTAAAGCAGGAGTTCTGGACACTGGGGGAGAGTGGGGTGAC 

42 ATGAGTGACTCCAAGGAACCAAGACTGCAGCAGCTGGGCCTCCTGGAGGAGGAACAGCTG 
1 MSDSKEPRLQQLGLLEEEQ-L * 

102 AGAGGCCTTGGATTCCGACAGACTCGAGGATACAAGAGCTTAGCAGGGTGTCTTGGCCAT 
21 RGLGFRQTRGYKSLAGCLGH 

162 GGTCCCCTGGTGCTGCAACTCCTCTCCTTC^CGCTCTTGGCTGGGCTCCTTGTCCAAGTG 
41 GPLVLQLLSF TL LAGLL V Q V 

222 TCCAAGGTCCCCAGCTCCATAAGTCAGGAACAATCCAGGCAAGACGCGATCTACCAGAAC 
61 SK V P S S I S Q E Q S RQ D A I Y Q N 

Rl * 

282 CTGACCCAGCTTAAAGCTGCAGTGGGTGAGCTCTCAGAGAAATCCAAGCTGCAGGAG ATC 
81 LTQ LKAAV G E LS E KSKLQ E I 

R2 

34 2 TACCAGGAGCTGACCCAGCTGAAGGCTGCAGTGGGTGAGCTTCCAG AG AAATCTAAGCTG 
101 Y Q E L T Q L K A A V G E L P E K S K L 

402 CAGGAGATCTACCAGGAGCTGACCCGGCTGAAGGCTGCAGTGGGTGAGCTTCCAGAGAAA 
121 Q E I Y Q E LT R LKAAV G E L P E K 

R3 

462 TCTAAGCTGCAGGAGATCTACCAGGAGCTGACCTGGCTGAAGGCTGCAGTGGGTGAGCTT 
141 SKL Q E I Y Q E LTWLK AAV G E L 

R4 

522 CCAGAGAAATCTAAGATGCAGGAGATCTACCAGGAGCTGACTCGGCTGAAGGCTGCAGTG 
161 PEK S KMQE I Y QELT RL K AAV 

R5 

582 GGTGAGCTTCCAGAGAAATCTAAGCAGCAGGAGATCTACCAGGAGCTGACCCGGCTGAAG 
181 G ELPEKSKQQEIYQELTRL K 

R6 

64 2 GCTGC^GTGGGTGAGCTTCCAGAGAAATCTAAGCAGCAGGAGATCTACCAGGAGCTGACC 
201 A A V G E L P E K S K Q Q E I Y Q E L T 

R7 

702 CGGCTGAAGGCTGCAGTGGGTGAGCTTCCAGAGAAATCTAAGCAGCAGGAGATCTACCAG 
221 RL K AAVGELPEKSK QQEIY Q 

R8 

762 GAGCTGACCCAGCTG AAGGCTGCAGTGGAACGCCTGTGCCACCCCTGTCCCTGGG AATGG 
241 E L T Q L K A A V E R L C H P C P W E W 

L 

822 ACATTCTTCCAAGGAAACTGTTACTTCATGTCTAACTCCCAGCGGAACTGGCACGACTCC 
261 TFFQ G N CY FMSNSQRN W H D S 



Figure 3A 
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882 
281 

942 
301 

1002 
321 

1062 
341 

1122 
361 

1182 
381 

1242 
401 
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ATCACCGCCTGCAAAGAAGTGGGGGCCCAGCTCGTCGTAATCAAAAGTGCTGAGGAGCAG 
ITAC KEVGAQLVVIKSA E EQ 

AACTTCCTACAGCTGCAGTCTTCCAGAAGTAACCGCTTCACCTGGATGGGACTTTCAGAT 
N F LQ LQ S S RS N RF TW MG L S D 

CTAAATCAGGAAGGCACGTGGCAATGGGTGGACGGCTCACCTCTGTTGCCCAGCTTCAAG 
L N Q E G T W Q W V D G S P L L P S r 

CAGTATTGGAACAGAGGAGAGCCCAACAACGTTGGGGAGGAAGACTGCGCGGAATTTAGT 
Q Y W N R G E P N N V G E E D C A E F S 

GGCAATGGCTGGAACGACGACAAATGTAATCTTGCCAAATTCTGGATCTGCAAAAAGTCC 
G NG W N D D K CN LAK FW I C K K S 

GCAGCCTCCTGCTCCAGGGATGAAGAACAGTTTCTTTCTCCAGCCCCTGCCACCCCAAAC 
A A SCSRDEEQFLSPAPAT P N 

' CCCCCTCCTGCGTAGCAGAACTTCACCCCCTTTTAAGCTACAGTTCCTTCTCTCCATCCT. 
P P P A *** 



1302 TCG ACCTTTAG 



Figure 3A(cont.) 
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